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DETAILED ACTION 



Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



1. Claims 1-6, 8, 9, 14-18, 20, 21, 26-29, 35 and 36 are rejected under 35 U.S.C. 
103(a) as being unpatentable over Almeida EP 0942575 in view of Newman et al. US 
5,946,654. 



1.1 Regarding claim 1 , Almeida discloses a system using the Hidden Markov Models 
(HMM) for speech recognition (paragraph 0053), and a method or identifying a caller, 
comprising steps of: 

receiving a voice input from the caller, including a plurality of speech units, i.e. a 
first name and a last name (figure 4, steps 301 and 302, paragraph 0091 ); 

identifying a caller by voice recognition with speaker dependent (previous 
identified caller) recognizer and speaker independent (genetic) recognizer (Abstract); 
using a plurality of speech recognizers, i.e. a speaker dependent recognizer comprising 
a plurality of known caller voice patterns (acoustic models), and the speaker 
independent recognizer (also comprising a plurality of generic voice patterns) for 
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identifying a caller's name, wherein the speaker independent recognizer comprise a 
plurality of speech units, i.e. a first name and a last name of each caller inputted by a 
user (figure 4; steps 305-309; paragraphs 0056, 0089, 0093 and 0094); 

identifying the caller as an previous identified caller or a new caller based on the 
match (score) of the speaker dependent recognizer and the speaker independent 
recognizer (paragraph 0093 and 0094); and 

if the caller is identified as a new caller, generating a new speech sample (new 
model) for a new caller (paragraph 0094). 

Almeida teaches generating new caller (a first time caller who's name is on a 
user's list per paragraph 0089) speech models (caller name patterns) by a name- 
adaptive trainer. Almeida fails to teach that the new caller speech models are 
generated by modifying speaker independent models (first name and last name inputted 
by the user in paragraph 0089). 

However, Newman discloses a speaker identification system and teaches that a 
speaker dependent model is produced by modifying a speaker independent model 
based on training data generated from speech of a known speaker (column 7, lines 40- 
43). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to further modify the Almeida's reference with the teaching 
of Newman so that new caller name patterns (acoustic models) would have been 
generated by modifying speaker independent models that comprising same speech 
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units (a first name and a last name), because such a modification would have clarified 
how the new caller models of Almeida were generated by the name-adaptive trainer. 

1 .2 Regarding claim 2, it is obvious that if a caller is an identified caller, speaker 
dependent recognizer will produce a better match score than the speaker independent 
recognize, and Almeida teaches that if a caller is identified by a speaker independent 
recognizer, but not by the speaker dependent recognizers, which is also obvious that in 
this case the speaker independent recognizer has a better match score. 

1 .3 Regarding claim 3, as discussed in above, Almeida also teaches using HMM 
which segments a speech input into phonetic sequence (paragraph 0053). 

1 .4 Regarding claims 4 and 5, Almeida teaches updating identified caller speech 
models (figure 4, steps 306, 309; paragraph 0093). 

1 .5 Regarding claim 6, Almeida teaches collecting a voice sample (pattern or model) 
of a new caller (figure 4, steps 308 and 309; paragraph 0094). 

1 .6 Regarding claim 8, Almeida teaches that the caller is not being alerted while 
performing speech recognition (figure 4, steps 302-309). 
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1 .7 Regarding claim 9, as discussed in above, Almeida also teaches using HMM 
which segments a speech input into phonetic sequence (paragraph 0053). 

1 .8 Regarding claims 14 and 26, Almeida discloses a system using the Hidden 
Markov Models (HMM) for speech recognition (paragraph 0053), comprising: 

a incoming voice recorder for receiving a voice input from the caller, including a 
first name and a last name (figure 2, steps 301 and 302; paragraphs 0066, 0091); 

memories 120 and 125 for storing speaker dependent (previous identified caller) 
speech patterns (acoustic models) and speaker independent speech patterns (genetic 
models), wherein speech patterns includes a first name and last name (figure 2; 
paragraphs 0075, 0082 and 0089); 

parameters analyzers 118 and 123 for applying characteristics of the voice input 
to the speech patterns to produce a plurality of scores (figure 4; steps paragraphs 
0091-0094, 0056); 

name identifiers 122 and 127 for identifying the caller as an previous identified 
caller or a new caller based on the match (score) of the speaker dependent recognizer 
and the speaker independent recognizer (paragraph 0093 and 0094); and 

if the caller is identified as a new caller, using a speech pattern generator 
generating new speech samples (new models) comprising a first name and a last name 
(same speech units for a new caller) (figure 4, step 309; paragraph 0094). 

Almeida teaches generating new caller (a first time caller who's name is on a 
user's list per paragraph 0089) speech models (caller name patterns) by a name- 
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adaptive trainer. Almeida fails to teach that the new caller speech models are 
generated by modifying speaker independent models (first name and last name inputted 
by the user in paragraph 0089). 

However, Newman discloses a speaker identification system and teaches that a 
speaker dependent model is produced by modifying a speaker independent model 
based on training data generated from speech of a known speaker (column 7, lines 40- 
43). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to further modify the Almeida's reference with the teaching 
of Newman so that new caller name patterns (acoustic models) would have been 
generated by modifying speaker independent models that comprising same speech 
units (a first name and a last name), because such a modification would have clarified 
how the new caller models of Almeida were generated by the name-adaptive trainer. 

1 .9 Regarding claims 1 5 and 27, Almeida also teaches using HMM which segments 
a voice input into phonetic sequence for speech recognition (paragraph 0053). 

1.10 Regarding claims 16, 17 and 28, Almeida teaches updating identified caller 
speech models (figure 4, steps 306, 309; paragraph 0093). 
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1.11 Regarding claims 18 and 29, as discussed above, Almeida teaches that if the 
caller is identified as a new caller, using a speech pattern generator to generate new 
speech samples (new models) for a new caller (figure 4, step 309; paragraph 0094). 

1 .12 Regarding claim 20, Almeida teaches that the caller is not being alerted while 
performing speech recognition (figure 4, steps 302-309). 

1.13 Regarding claim 21 , as discussed in above, Almeida also teaches using HMM 
which segments a speech input into phonetic sequence (paragraph 0053). 

1.14 Regarding claim 35, Almeida discloses a system and method or identifying a 
caller. Almeida teaches: 

receiving a voice input from the caller, including a plurality of speech units, i.e. a 
first name and a last name (figure 4, steps 301 and 302; paragraph 0091); 

identifying a caller by voice recognition with speaker dependent (previous 
identified caller) recognizer and speaker independent (genetic) recognizer (Abstract); 
using a plurality of speech recognizers, i.e. a speaker dependent recognizer comprising 
a plurality of known caller voice patterns (acoustic models), and the speaker 
independent recognizer (also comprising a plurality of generic voice patterns) for 
identifying a caller's first name and last name (figure 4; steps 305-309; paragraphs 
0056, 0089, 0093 and 0094); 
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identifying the caller as an previous identified caller or a new caller based on the 
match (score) of the speaker dependent recognizer and the speaker independent 
recognizer (paragraph 0093 and 0094); and 

if the caller is identified as a new caller, generating new speech samples (new 
models) for a new caller with same plurality of speech units (i.e. a first name and a last 
name) (paragraph 0094). 

Almeida further teaches using the Hidden Markov Models (HMM) for speech 
recognition (paragraph 0053). The HMM inherently segments the voice input into a 
sequence of phonetic units for speech recognition (see US 5,825, 977 issued to Morin 
et al. column 3, lines 36-46, and also US 5,649,056 issued to Nitta, column 4, lines 24- 
29). 

Almeida teaches generating new caller (a first time caller who's name is on a 
user's list per paragraph 0089) speech models (caller name patterns) by a name- 
adaptive trainer. Almeida fails to teach that the new caller speech models are 
generated by modifying speaker independent models (first name and last name inputted 
by the user in paragraph 0089). 

However, Newman discloses a speaker identification system and teaches that a 
speaker dependent model is produced by modifying a speaker independent model 
based on training data generated from speech of a known speaker (column 7, lines 40- 
43). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to further modify the Almeida's reference with the teaching 
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of Newman so that new caller name patterns (acoustic models) would have been 
generated by modifying speaker independent models that comprising same speech 
units (a first name and a last name), because such a modification would have clarified 
how the new caller models of Almeida were generated by the name-adaptive trainer. 

1.15 Regarding claim 36, as discussed above, Almeida teaches that if the caller is 
identified as a new caller, generating new speech samples (new models) for a new 
caller (figure 4, step 309; paragraph 0094). 

2. Claims 10, 1 1 , 22, 23, 31 , 32 and 37 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Almeida EP 0942575 in view of Newman et al. US 5,946,654 
and further in view of Maes US 6,088,669. 

2.1 Regarding claims 10, 22, 31 and 37, the modified Almeida reference teaches 
maintaining caller specific voice patterns (acoustic models) with speech units in 
sequence (first name, last name) in a memory 120, but fails to teach maintaining a 
caller specific language model for each of the previously identified callers. 

However, Maes discloses a method for identifying a speaker, using speaker 
dependent and speaker independent models (column 5, line 61 to column 6, line 13). 
Maes further teaches using different models, such as HMM and special language 
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models specific to previous identified speakers for speech recognition (column 4, lines 
42-49). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to further modify the Almeida's reference with the teaching 
of Maes so that specific language models would have been maintained, because such a 
modification would have improved the accuracy of speech recognition as taught by 
Maes (see Maes, column 4, lines 49-52). 

2.2 Regarding claims 1 1 , 23 and 32, Almeida teaches identifying a caller as a new 
caller (who is a different caller from a previously identified caller) by speaker 
independent recognizer (paragraph 0094), and identifying the new caller by a speech 
unit sequence (first name, last name). 

3. Claims 12, 24, 33 and 38 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Almeida EP 0942575 in view of Newman et al. US 5,946,654 and 
further in view of Beigi et al. US 6,684,186. 

The modified Almeida reference teaches maintaining caller specific voice 
patterns (acoustic models) in a memory 120, but fails to teach merging voice patterns 
using a distance measurement. 

However, Beigi discloses a speaker recognition system in that speech models 
with closest distance are merged (column 4, lines 1-12). 
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Therefore, it would have been obvious to one of ordinary skill in the art at the 
time the invention was made to modify the Almeida's reference with the teaching of 
Beigi so that speech patters with closest distance would have been merged, because 
such a modification would have saved memory space for not storing duplicated speech 
patterns. 

Response to Arguments 

4. Applicant's arguments with respect to claim*** have been considered but are 
moot in view of the new ground(s) of rejection. 



Allowable Subject Matter 

5. Claims 1 3, 25 and 34 are objected to as being dependent upon a rejected base 
claim, but would be allowable if rewritten in independent form including all of the 
limitations of the base claim and any intervening claims. 

The following is a statement of reasons for the indication of allowable subject 
matter: The modified Almeida teaches merging speech models, but fails to teach 
generating a user prompt identifying the merging speech models. Since Almeida 
teaches automatic speech recognition, there is no motivation to modify Almeida's 
reference into a manual operating system. 
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Conclusion 



6. Any inquiry concerning this communication or earlier communication from the 
examiner should be directed to Simon Sing whose telephone number is 571-272-7545. 
The examiner can normally be reached on Monday - Friday from 8:30 AM to 5:30 PM. 
If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Fan Tsang, can be reached at 571-272-7547. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. Any 
inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the receptionist whose telephone number is 571-272-2600. 
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